On Maximum Margin Hierarchical Multilabel Classification
نویسندگان
چکیده
We present work in progress towards maximum margin hierarchical classification where the objects are allowed to belong to more than one category at a time. The classification hierarchy is represented as a Markov network equipped with an exponential family defined on the edges. We present a variation of the maximum margin multilabel learning framework, suited to the hierarchical classification task and allows efficient implementation via gradient-based methods. We compare the behaviour of the proposed method to the recently introduced hierarchical regularized least squares classifier as well as two SVM variants in Reuter’s news article classification. Often in hierarchical classification, the object to be classified is assumed to belong to exactly one (leaf) node in the hierarchy (c.f. [5, 2, 4]). Following [3], in this paper we consider the more general case where a single object can be classified into several categories in the hierarchy, to be specific, the multilabel is a union of partial paths in the hierarchy. For example, a news article about David and Victoria Beckham could belong to partial paths sport, football and entertainment, music but might not belong to any leaf categories such as champions league or jazz. In our setting the training data ((xi,y(xi))) m i=1 consists of pairs (x,y) of vector x ∈ R and a multilabel y ∈ {+1,−1} consisting of k microlabels. As the model class we use the exponential family
منابع مشابه
Kernel-Based Learning of Hierarchical Multilabel Classification Models
We present a kernel-based algorithm for hierarchical text classification where the documents are allowed to belong to more than one category at a time. The classification model is a variant of the Maximum Margin Markov Network framework, where the classification hierarchy is represented as a Markov tree equipped with an exponential family defined on the edges. We present an efficient optimizati...
متن کاملAdapting non-hierarchical multilabel classification methods for hierarchical multilabel classification
In most classification problems, a classifier assigns a single class to each instance and the classes form a flat (non-hierarchical) structure, without superclasses or subclasses. In hierarchical multilabel classification problems, the classes are hierarchically structured, with superclasses and subclasses, and instances can be simultaneously assigned to two or more classes at the same hierarch...
متن کاملAdaptive Large Margin Training for Multilabel Classification
Multilabel classification is a central problem in many areas of data analysis, including text and multimedia categorization, where individual data objects need to be assigned multiple labels. A key challenge in these tasks is to learn a classifier that can properly exploit label correlations without requiring exponential enumeration of label subsets during training or testing. We investigate no...
متن کاملAbstract of " Multilabel Classification over Category Taxonomies " Multilabel Classification over Category Taxonomies Finally I Want to Specially Thank My Father
of “Multilabel Classification over Category Taxonomies” by Lijuan Cai, Ph.D., Brown University, May 2008. Multilabel classification is the task of assigning a pattern to one or more classes or categories from a pre-defined set of classes. It is a crucial tool in knowledge and content management. Standard machine learning techniques such as Support Vector Machines (SVMs) and Perceptron have been...
متن کاملIIS at ImageCLEF 2015: Multi-label Classification Task
We propose an image decomposition technique that captures the structure of a scene. An image is decomposed into a matrix that represents the adjacency between the elements of the image and their distance. Images decomposed this way are then classified using a maximum margin regression (MMR) approach where the normal vector of the separating hyperplane maps the input feature vectors into the out...
متن کامل